Glottal Waveform Model for Oesophageal Speech

نویسنده

  • John M. O’ Toole
چکیده

Oesophageal speech, a mode of speech for laryngectomees, is of low quality and intelligibility comparative to normal (laryngeal) speech. Understanding the signal differences between oesophageal and normal speech will help future oesophageal speech enhancement methods. We aim to produce a method to synthesise oesophageal speech using a simple source–filter model. In this paper, we fit a parametric glottal waveform model to speech samples in our oesophageal database. (This glottal waveform represents the source component in the source–filter approach.) We added coloured noise to the glottal waveform model to produce realistic sounding oesophageal speech. Our fitting error measure, a spectral distance measure, reduces for all tested speech samples when adding the coloured noise. Yet missing from our synthesised signal is the rough-sounding quality often present in oesophageal speech. This work represents the first steps in developing a method to synthesise oesophageal speech.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generative Adversarial Network-Based Glottal Waveform Model for Statistical Parametric Speech Synthesis

Recent studies have shown that text-to-speech synthesis quality can be improved by using glottal vocoding. This refers to vocoders that parameterize speech into two parts, the glottal excitation and vocal tract, that occur in the human speech production apparatus. Current glottal vocoders generate the glottal excitation waveform by using deep neural networks (DNNs). However, the squared error-b...

متن کامل

Improving Glottal Waveform Rank-based Glottal Qua

Information on the glottal waveform is an important part of many speech applications. However, glottal waveform estimation remains one of the more inexact sciences of speech processing. The work presented here describes an enhancement to a recently presented algorithm by a new technique involving Rank-Based Glottal Quality Assessment (RB-GQA). The basic premise is to investigate potential measu...

متن کامل

Segmentation of a speech waveform according to glottal open and closed phases using an autoregressive-HMM

This paper presents an algorithm to segment speech according to glottal open and closed phases using the time waveform alone. Based on this, pitch, jitter and closed to open glottal ratios can be computed. Segmentation is achieved by identifying spectral changepoints at the subpitch period timescale. Changepoints are identi ed using a 3-state autoregressive hidden Markov model (AR-HMM) operatin...

متن کامل

Voice source analysis using biomechanical modeling and glottal inverse filtering

This paper studies the use of glottal inverse filtering together with a biomechanical model of the vocal folds to simulate the glottal flow waveform. The glottal flow waveform is first estimated by inverse filtering the acoustic speech pressure signal of natural speech. The estimated glottal flow is used as a template in an optimization process which searches for a set of parameters for a deter...

متن کامل

Effects of Training Data Variety in Generating Glottal Pulses from Acoustic Features with DNNs

Glottal volume velocity waveform, the acoustical excitation of voiced speech, cannot be acquired through direct measurements in normal production of continuous speech. Glottal inverse filtering (GIF), however, can be used to estimate the glottal flow from recorded speech signals. Unfortunately, the usefulness of GIF algorithms is limited since they are sensitive to noise and call for high-quali...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012